Short Running Title: Distributions of Surfers’ Paths through the WWW

نویسندگان

  • Peter Pirolli
  • Peter L. T. Pirolli
  • James E. Pitkow
چکیده

Surfing the World Wide Web (WWW) involves traversing hyperlink connections among documents. The ability to predict surfing patterns could solve many problems facing producers and consumers of WWW content. We analyzed WWW server logs for a WWW site, collected over ten days, to compare different path reconstruction methods and to investigate how past surfing behavior predicts future surfing choices. Since log files do not explicitly contain user paths, various methods have evolved to reconstruct user paths. Session times, number of clicks per visit, and Levenshtein Distance analyses were performed to show the impact of various reconstruction methods. Different methods for measuring surfing patterns were also compared. Markov model approximations were used to model the probability of users choosing links conditional on past surfing paths. Information-theoretic (entropy) measurements suggest that information is gained by using longer paths to estimate the conditional probability of link choice given surf path. The improvements diminish, however, as one increases the length of path beyond one. Information-theoretic (Total Divergence to the Average entropy) measurements suggest that the conditional probabilities of link choice given surf path are more stable over time for shorter paths than longer paths. Direct examination of the accuracy of the conditional probability models in predicting test data also suggests that shorter paths yield more stable models and can be estimated reliably with less data than longer paths.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Neighborhoods - Browsing the World Wide Web Together

We introduce an original scheme that turns the World Wide Web (WWW) into a social place. Our vicinity-based approach is to show the Web’s user space to the people surfing on the WWW. Being aware of other Web surfers, people browsing the Web may invoke synchronous communication by a single mouse-click – surfing the Web turns from a lonely affair into a joint experience. We present the basic theo...

متن کامل

The Effect of Prehabilitation on the Self-Reported Outcomes of Anterior Cruciate Ligament Reconstruction: A Systematic Review Running Title:

Background and Purpose: Quadriceps weakness and disruption of proprioceptive function are common after anterior cruciate ligament (ACL) injury and consequently the surgery. Postoperative self-reported outcomes are affected by the preoperative defect. The purpose of this review study was to examine whether preoperative exercises can affect self-reported outcomes. Methods: The study started sear...

متن کامل

Design and Evaluation of Improvement method on the Web Information Navigation - A Stochastic Search Approach

With the advent of fast growing Internet and World Wide Web (WWW), more and more companies start the electronic commerce to enhance the business competitiveness. On the other hand, more and more people surf on the Web for information gathering/processing. Due to unbalanced traffic and poorly organized information, users suffer the slow communication and disordered information organization. The ...

متن کامل

Pre-Fetching Web Pages Through Data Mining Based Prediction

The speed of fetching web pages to users is getting lower because the rapid expansion of Internet use, the inherited character of delay in the network and the Request/Response working mode of WWW, and this is becoming a serious concern for web surfers. In order to speed up fetching web pages, this paper presents an intelligent technique of web pre-fetching. We use a simplified WWW data model to...

متن کامل

NEMO: Next Career Move Prediction with Contextual Embedding

With increased globalization and labor mobility, human resource reallocation across firms, industries and regions has become the new norm in labor markets. The emergence of massive digital traces of such mobility offers a unique opportunity to understand labor mobility at an unprecedented scale and granularity. While most studies on labor mobility have largely focused on characterizing macro-le...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999